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CRYSTAL STRUCTURE OF STAPHYLOCOCCUS UNDECAPRENYL 
PYROPHOSPHATE SYNTHASE AND USES THEREOF 

Field of the Invention 

5 The invention is directed generally to the crystal structure of enzymes. More 

particularly, the invention relates to the atomic structure of the substrate-binding sites 
of enzymes involved in the chain elongation of isoprenoid chains and the use of the 
structure in drug design. 

10 Reference to Related Application 

This application claims priority to U.S. Provisional Patent Application 
60/419,952, filed on October 21, 2002. 

Sequence Listing Reference 
15 This application includes one computer sequence listing (SEQ. ID NOS. 1 to 

20) appendix submitted on one computer diskette and the material on the computer 
diskette sequence listing (SEQ. ID NOS. 1 to 20) is incorporated-by-reference into 
the present application. 

20 Background of the Invention 

Prenyltransferases are enzymes important in lipid, peptidoglycan, and 
glycoprotein biosynthesis. These enzymes act on molecules having a five-carbon 
isoprenoid substrate. Prenyltransferases are classified into two major subgroups 
according to whether they catalyze the cis- or frans-isomerization of products in the 

25 prenyl chain elongation. E-type prenyltransferases catalyze frans-isomerization and 
z-type prenyltransferases catalyze c/s-isomerization. Unlike the trans-type 
prenyltransferases, the c/s-prenyltransferases are poorly categorized. In particular, 
little is known about the detailed molecular structure of the active site of c/s- 
prenyltransferases. In consequence, inhibitors of the c/s-prenyltransferases have 

30 been difficult to establish using a structure-based approach. This deficiency is 
particularly important because c/s-prenyltransferases are involved in the biosynthesis 
of peptidoglycan in prokaryotes and that of glycoproteins in eukaryotes. Such 
pathways are crucial for survival of the organism. 
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Bacterial undecaprenyl pyrophosphate synthase (UPS), also known as 
undecaprenyl diphosphate synthase, is a z-type prenyltransferase that catalyzes the 
sequential condensation of eight molecules of isoprenyl pyrophosphate (IPP) with 
trans, frans-farnesyl pyrophosphate (FPP) to produce the 55-carbon molecule termed 
5 undecaprenyl pyrophosphate. Undecaprenyl pyrophosphate is released from the 
synthase and dephosphorylated to form undecaprenyl phosphate that serves as the 
essential carbohydrate and lipid carrier in bacterial cell wall and lipopolysaccharide 
biosynthesis. Undecaprenyl pyrophosphate synthase differs from other members of 
the prenyltransferase family in the product stereochemistry and product chain length. 

10 Emerging resistance to currently used antibacterial agents has generated an 

urgent need for antibiotics acting by different mechanisms. Undecaprenyl 
pyrophosphate synthase exists ubiquitously in bacteria and plays an essential and 
critical roll in the cell wall biosynthesis pathway. Thus, undecaprenyl pyrophosphate 
synthase is essential for cell viability and provides a valid and unexploited molecular 

15 target for antibacterial drug discovery. In consequence, a structure-based approach 
to development of inhibitors could provide novel antibiotics. 

Fujihashi et al., 2001, show atomic coordinates for a crystal of undecaprenyl 
pyrophosphate from M. luteus, in the absence of substrate or cofactor. Fujihashi et 
al., 2001, Crystal structure of cis-preny/ chain elongating enzyme, undecaprenyl 

20 diphosphate synthase, 98 Proc. Natl. Acad. Sci USA 4337. Some amino acid 
residues are not defined in the crystal structure. 

Ko et al., 2001, show atomic coordinates for a crystal of undecaprenyl 
pyrophosphate from E. coli, in the absence of substrate or cofactor. Ko et al., 2001, 
Mechanism of Product Chain Length Determination and the Role of a Flexible Loop in 

25 E. coli Undecaprenyl-pyrophosphate Synthase Catalysis, 276 J. Biol. Chem. 47474. 
Some amino acid residues are not defined in the crystal structure. 

Huang et al., U.S. Patent No. 6,287,810 is directed to polynucleotides 
encoding an undecaprenyl pyrophosphate synthase of S. aureus, but does not teach 
a three-dimensional structure of undecaprenyl pyrophosphate synthase. 

30 

Summary of the Invention 
The invention relates generally to protein crystal structures and uses thereof 
in drug design. More particularly, the invention relates to Staphylococcus 
undecaprenyl pyrophosphate synthase in crystalline form. The invention also relates 
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to a composition comprising the synthase in crystalline form. The composition can 
further comprise at least one ligand. 

In one embodiment, the invention comprises a composition comprising a 
Staphylococcus undecaprenyl pyrophosphate synthase in crystalline form, the 
5 synthase comprising an amino acid sequence at least about 80% homologous to 
SEQ ID NO:1. In a preferred embodiment of the synthase the amino acid sequence 
is at least about 90% homologous. 

The synthase can have a first ligand binding site, a second ligand binding site, 
or both. Moreover, the composition comprising the synthase can comprise at least 

10 one ligand. The ligand can be co-crystallized with the synthase. Suitable ligands 
include, but are not limited to, farnesyl pyrophosphate, (S)-farnesyl 
thiopyrophosphate, isoprenyl phyrophosphate, magnesium ion, and sulfate ion. 
Preferably farnesyl pyrophosphate or (S)-farnesyl thiopyrophosphate are associated 
with the first ligand binding site and isoprenyl pyrophosphate or sulfate are associated 

15 with the second ligand binding site. 

In another aspect the undecaprenyl pyrophosphate synthase comprises a first 
ligand binding site defined by at least one amino acid residue selected from the group 
consisting of Asp 33 , Gly 34 , Gly 36 , Arg 37 , Arg 46 , Ala 76 , Arg 84 , Leu 95 , Pro 96 , and Phe 148 . In 
a preferred embodiment, the crystal can comprise a first ligand binding site defined 

20 by amino acid residues 33, 34, 36, 37, 46, 76, 84, 95, 96, and 148 having atoms 
having atomic coordinates according to Figure 5. 

In yet another aspect, the undecaprenyl pyrophosphate synthase comprises a 
second binding site formed by at least one amino acid residue selected from the 
group consisting of Asp 33 , Arg 201 , Arg 207 , and Set 209 from one chain (A) of the dimer, 

25 and Glu 220 and Gly 251 from the other chain (B) of the dimer. Thus, both polypeptide 
chains can contribute to the second binding site. In a preferred embodiment, the 
crystal can comprise a second ligand binding site defined by amino acid residues 33, 
201, 207, 209, 220(B), and 251(B) having atoms having atomic coordinates 
according to Figure 5. 

30 The invention also relates to undecaprenyl pyrophosphate synthase in 

crystalline form wherein the synthase is S. aureus undecaprenyl pyrophosphate 
synthase. 
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According to the present invention, also provided is a selenomethionine 
substitution crystalline form of a Staphylococcus undecaprenyl pyrophosphate 
synthase. The synthase can be from S. aureus. 

In still another aspect, the invention is directed to a composition comprising 
5 undecaprenyl pyrophosphate synthase in crystalline form and a substrate. The 
synthase can be from any organism, not limited to Staphylococcus. 

One aspect of the invention is directed to a method of designing or identifying 
a potential ligand for an undecaprenyl pyrophosphate synthase comprising using a 
three-dimensional structure of an undecaprenyl pyrophosphate synthase, employing 
10 the three dimensional structure to design or select the potential ligand, obtaining the 
potential ligand; and contacting the potential ligand with the undecaprenyl 
pyrophosphate synthase to determine binding to the undecaprenyl pyrophosphate 
synthase. One skilled in the art will recognize that the steps of the method can be 
carried out in various orders. The three-dimensional structure of a binding site can be 
15 defined by atomic coordinates of amino acid residues 33, 34, 36, 46, 76, 84, 95, 96, 
and 148 according to Figure 5. 

The method can further comprise identifying chemical entities or fragments 
thereof, capable of binding to the undecaprenyl pyrophosphate synthase; and 
assembling the identified chemical entities or fragments thereof into a single molecule 
20 to provide the structure of the potential ligand. 

The potential ligand can be an inhibitor. In one embodiment the inhibitor is a 
competitive inhibitor. In another embodiment the inhibitor is a non-competitive 
inhibitor. The ligand can be designed ofe novo. Alternatively, the ligand can be 
designed from a known inhibitor. The method can further comprise using the atomic 
25 coordinates according to Figure 5, or portion thereof, of a ligand bound to the 
undecaprenyl pyrophosphate synthase. 

Another aspect of the invention is directed to a method for identifying a 
potential inhibitor of a mutant undecaprenyl pyrophosphate synthase, the method 
comprising using a three-dimensional structure of undecaprenyl pyrophosphate 
30 synthase as defined by atomic coordinates of undecaprenyl pyrophosphate synthase 
according to Figure 5; replacing one or more undecaprenyl pyrophosphate synthase 
amino acids selected from 33, 34, 36, 37, 46, 76, 84, 95, 96, 148, 201, 207, 209, 220, 
and 251 of SEQ ID NO:1 in the three-dimensional structure with a different naturally 
occurring amino acid, thereby forming a mutant undecaprenyl pyrophosphate 
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synthase; employing the three-dimensional structure to design or select the potential 
inhibitor; synthesizing the potential inhibitor; and contacting the potential inhibitor with 
the mutant undecaprenyl pyrophosphate synthase or the undecaprenyl 
pyrophosphate synthase in the presence of a substrate to test the ability of the 
5 potential inhibitor to inhibit the undecaprenyl pyrophosphate synthase or the mutant 
undecaprenyl pyrophosphate synthase. The potential inhibitor can be selected from 
a database. 

In another aspect, the invention is directed to a method for identifying a 
potential inhibitor for an undecaprenyl pyrophosphate synthase, comprising using a 

10 three-dimensional structure of the synthase as defined by atomic coordinates of 
undecaprenyl pyrophosphate synthase according to Figure 5; employing said three- 
dimensional structure to design or select the potential inhibitor; synthesizing the 
potential inhibitor; and contacting the potential inhibitor with the synthase in the 
presence of a substrate to determine the ability of the potential inhibitor to inhibit the 

15 synthase. 

In one embodiment, the three-dimensional structure can be further defined by 
atomic coordinates of amino acid residues 201, 207, and 209 according to Figure 5. 
In another embodiment, the three-dimensional structure can be further defined by 
atomic coordinates of amino acid residues 220(B) and 251(B), according to Figure 5. 
20 Amino acid residues labeled "B" are from the complementary polypeptide chain of 
the dimer. 

The potential ligand can be designed to form a hydrogen bond with at least 
one amino acid residue selected from the group consisting of Gly 34 , Gly 36 , Arg 37 , 
Arg 46 , and Arg 84 . In addition, or alternatively, the potential ligand can be designed to 
25 form a hydrogen bond with at least one amino acid residue selected from the group 
consisting of Arg 201 , Arg 207 , Ser 209 , Glu 220 (B), and Gly 251 (B). In another embodiment, 
the potential ligand can be designed to form a hydrophobic bond with at least one 
amino acid residue selected from the group consisting of Ala 76 , Leu 95 , Pro 96 , and 
Phe 143 . 

30 In one aspect the invention is directed to a ligand identified by these methods. 

The invention also relates to a method of identifying a ligand capable of 
binding to an undecaprenyl pyrophosphate synthase substrate binding site, 
comprising: (a) introducing into a suitable computer program information defining the 
binding site comprising first atomic coordinates of amino acids capable of binding to a 



-6- 



synthase substrate, wherein the program displays the three-dimensional structure of 
the binding site; (b) creating a three dimensional model of a test compound in the 
computer program; (c) docking the model of the test compound to the structure of the 
binding site; (d) creating a second three dimensional model of the substrate or an 
5 inhibitor of the synthase and docking the second model thereto; and (e) comparing 
the docking of the test compound and of the substrate or an inhibitor of the synthase 
to provide an output of the program. In one embodiment, the method further 
comprises introducing into the computer program second atomic coordinates of water 
molecules bound to the substrate. In another embodiment, the method further 

10 comprising introducing into the computer program third atomic coordinates of at least 
one synthase structural element selected from the group consisting of an alpha helix, 
a 3 10 helix, a strand of beta sheet, and a coil. The 3 10 helix can comprise the amino 
acid residue sequence Asn Trp Ser. 

In yet another embodiment the method further comprises: (f) incorporating 

15 the test compound into a biological or biochemical assay for synthase activity; and (g) 
determining whether the test compound inhibits synthase activity in the assay. 

The invention is also directed to a method of drug design comprising using 
the atomic coordinates of an S. aureus undecaprenyl pyrophosphate synthase, or 
substantial portion thereof, having at least one ligand binding site, to computationally 

20 evaluate relative associations of chemical entities with the ligand binding site. The 
chemical entity can be an intermediate in a farnesyl pyrophosphate elongation 
reaction, or an analog thereof. 

In another aspect the invention is directed to a method for solving a crystal 
form comprising using the atomic coordinates of S. aureus undecaprenyl 

25 pyrophosphate synthase crystal, or portions thereof, to solve a crystal form of a 
mutant, homolog or co-complex of the undecaprenyl pyrophosphate synthase by 
Molecular Replacement. The method can further comprise using the atomic 
coordinates of a ligand bound to undecaprenyl pyrophosphate synthase. 

One aspect of the invention is directed to a machine-readable data storage 

30 medium comprising a data storage material encoded with machine-readable data 
comprising atomic coordinates comprising amino acid residues 33, 34, 36, 37, 46, 76, 
84, 95, and 148 according to Figure 5. In one embodiment, the machine-readable 
data further comprise atomic coordinates comprising at least one amino acid residue 
selected from the group consisting of 201, 207, 209, 220(B), and 251(B) according to 
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Figure 5. In another embodiment, the machine-readable data comprise the three- 
dimensional structure of S. aureus undecaprenyl pyrophosphate synthase. 

In another aspect, the invention comprises a computer-implemented tool for 
design of a drug, comprising: (a) a three-dimensional structure of a undecaprenyl 
5 pyrophosphate synthase as defined by atomic coordinates of a S. aureus 
undecaprenyl pyrophosphate synthase having at least one ligand binding site; (b) a 
model of a chemical entity; and (c) a computer program addressing the coordinates 
and capable of modeling the chemical entity in the ligand binding site to produce an 
output. 

10 In yet another aspect, the invention comprises a computer for producing a 

three-dimensional representation of a undecaprenyl pyrophosphate synthase ligand 
binding site comprising: (a) a machine-readable data storage medium comprising a 
data storage material encoded with machine-readable data comprising the atomic 
coordinates comprising the amino acid residues 33, 34, 36, 37, 46, 76, 84, 95, and 

15 148 according to Figure 5; (b) a working memory for storing instructions for 
processing the machine-readable data; (c) a central-processing unit coupled to the 
working memory and to the machine-readable data storage medium for processing 
the machine readable data into the three-dimensional representation; and (d) a 
display coupled to the central-processing unit for displaying the three-dimensional 

20 representation. The computer can also produce a three-dimensional representation 
of the ligand binding site of an undecaprenyl pyrophosphate synthase; and the 
machine-readable data can comprise the atomic coordinates of the ligand binding 
site. 

25 Brief Description of the Figures 

Figure 1 is a topology diagram of S. aureus undecaprenyl pyrophosphate 
synthase. 

Figures 2a and 2b are ribbon diagrams of the crystal structure of S. aureus 
undecaprenyl pyrophosphate synthase. Two orthogonal views of the dimer are 
30 shown in Figures 2a and 2b. In each figure, helices are labeled H and beta strands 
S. The ligands, FPP, Mg and Sulfate, are also labeled. 

Figure 3 is a ball and stick diagram of a part of the active site of S. aureus 
undecaprenyl pyrophosphate synthase showing all the interactions between protein 
and bound FPP and Mg. 
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Figure 4 is a ball and stick diagram of another part of the active site S. aureus 
undecaprenyl pyrophosphate synthase showing all the interactions between protein 
and bound sulfate. 

Figure 5 shows the atomic coordinates of the polypeptide chains of S. aureus 
5 undecaprenyl pyrophosphate synthase. 

Detailed Description of the Invention 
In order that the invention described herein may be more fully understood, the 
following detailed description is set forth. The following table lists the amino acid 
10 abbreviations used herein. 



A=Ala=Alanine 


T=Thr=Threonine 


V=Val=Valine 


C=Cys=Cysteine 


L=Leu=Leucine 


Y=Tyr=Tyrosine 


l=lle=lsoleucine 


N=Asn=Asparagine 


P=Pro=Proline 


Q=Gln=Glutamine 


F=Phe=Phenylalanine 


D=Asp=Aspartic Acid 


W=Trp=Tryptophan 


E=Glu=Glutamic Acid 


M=Met=Methionine 


K=Lys=Lysine 


G=Gly=Glycine 


R=Arg=Arginine 


S=Ser=Serine 


H=His=Histidine 



The following terms are used herein as follows, unless stated otherwise: 
The term "naturally occurring amino acids" means the L-isomers of the 
naturally occurring amino acids. The naturally occurring amino acids are glycine, 
15 alanine, valine, leucine, isoleucine, serine, methionine, threonine, phenylalanine, 
tyrosine, tryptophan, cysteine, proline, histidine, aspartic acid, asparagine, glutamic 
acid, glutamine, K-carboxyglutamic acid, arginine, ornithine and lysine. Unless 
specifically indicated, all amino acids referred to in this application are in the L-form. 

The term "unnatural amino acids" means amino acids that are not naturally 
20 found in proteins. Examples of unnatural amino acids used herein, include 
selenocysteine and selenomethionine. In addition, unnatural amino acids include D- 
phenylalanine and the D or L forms of nor-leucine, para-nitrophenylalanine, 
homophenylalanine, para-fluorophenylalanine, 3-amino-2-benzylpropionic acid, and 
homoarginine. 
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The term "positively charged amino acid" includes any naturally occurring or 
unnatural amino acid having a positively charged side chain under normal 
physiological conditions. Examples of positively charged naturally occurring amino 
acids are arginine, lysine and histidine. 
5 The term "negatively charged amino acid" includes any naturally occurring or 

unnatural amino acid having a negatively charged side chain under normal 
physiological conditions. Examples of negatively charged naturally occurring amino 
acids are aspartic acid and glutamic acid. 

The term "hydrophobic amino acid" means any amino acid having an 
10 uncharged, nonpolar side chain that is relatively insoluble in water. Examples of 
naturally occurring hydrophobic amino acids are alanine, leucine, isoleucine, valine, 
proline, phenylalanine, tryptophan and methionine. Histidine and tyrosine can also 
participate in hydrophobic bonds. 

The term "hydrophilic amino acid" means any amino acid having an 
15 uncharged, polar side chain that is relatively soluble in water. Examples of naturally 
occurring hydrophilic amino acids are serine, threonine, tyrosine, asparagine, 
glutamine, and cysteine. 

The term "hydrogen bond" is used to describe an interaction between polar 
atoms including N, O, and S, in which hydrogen forms a bridge. The side chains of 
20 ionic and hydrophilic amino acids and of amide moieties in the peptide backbone are 
candidates for hydrogen bonds. Polar and ionic moieties in substrates and inhibitors 
are candidates for hydrogen bonding. 

The term "hydrophobic bond" is used to describe a Van der Waals interaction 
of non-polar moieties that are enthalpicly or entropicly favored over interaction with 
25 water or polar groups. Thus, one model for hydrophobic bonds is the gain in free 
energy formed by exclusion of water. Prime candidates for forming hydrophobic 
bonds are the aliphatic tail of farnesyl pyrophosphate and side chains of amino acid 
residues including phenylalanine, tryptophan, proline, leucine, isoleucine, valine, 
alanine, histidine, and tyrosine. 
30 The term "residue" in amino acid residue refers to the part of an amino acid 

incorporated into a polypeptide. 

The term "ligand" refers to a chemical entity that binds to, or associates with, 
a synthase. Often, but not always, a ligand is a small molecule. A substrate is a 
ligand that can be, under appropriate conditions, chemically acted upon by the 
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synthase. In particular, farnesyl pyrophosphate is a substrate that binds to the 
synthase in the presence of magnesium ion, acting as a cofactor, but does not 
undergo a chemical reaction unless a second substrate, that is isoprenyl 
pyrophosphate, is present, and other conditions necessary for catalysis are met. 
5 The term "mutant" refers to an undecaprenyl pyrophosphate synthase 

polypeptide, i.e. a polypeptide displaying the biological activity of wild-type, 
undecaprenyl pyrophosphate synthase, characterized by the replacement of at least 
one amino acid from the wild-type, undecaprenyl pyrophosphate synthase sequence 
according to SEQ ID NO:1. Such a mutant may be prepared, for example, by 
10 expression of undecaprenyl pyrophosphate synthase cDNA previously altered in its 
coding sequence by oligonucleotide-directed mutagenesis, or other means well- 
known in the art. 

Undecaprenyl pyrophosphate synthase mutants may also be generated by 
site-specific incorporation of unnatural amino acids into undecaprenyl pyrophosphate 

15 synthase proteins using the general biosynthetic method of Noren, C. J., et al., 
Science, 244, pp. 182-188 (1989). In this method, the codon encoding the amino 
acid of interest in wild-type undecaprenyl pyrophosphate synthase is replaced by a 
"blank" nonsense codon, TAG, using oligonucleotide-directed mutagenesis. A 
suppressor tRNA directed against this codon is then chemically aminoacylated in 

20 vitro with the desired unnatural amino acid. The aminoacylated tRNA is then added 
to an in vitro translation system to yield a mutant undecaprenyl pyrophosphate 
synthase enzyme with the site-specific incorporated unnatural amino acid. 

Selenocysteine or selenomethionine may be incorporated into wild-type or 
mutant undecaprenyl pyrophosphate synthase as described below. In this method, 

25 the wild-type or mutagenized undecaprenyl pyrophosphate synthase cDNA may be 
expressed in a host organism on a growth medium depleted of either natural cysteine 
or methionine (or both) but enriched in selenocysteine or selenomethionine (or both). 

Altered surface charge describes a change in one or more of the charge units 
of a mutant polypeptide, at physiological pH, as compared to wild-type undecaprenyl 

30 pyrophosphate synthase. This is preferably achieved by mutation of at least one 
amino acid of wild-type undecaprenyl pyrophosphate synthase to an amino acid 
comprising a side chain with a different charge at physiological pH than the original 
wild-type side chain. 
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The change in surface charge is determined by measuring the isoelectric 
point (pi) of the polypeptide molecule containing the substituted amino acid and 
comparing it to the isoelectric point of the wild-type undecaprenyl pyrophosphate 
synthase molecule. 

5 Altered substrate specificity refers to a change in the ability of a mutant 

undecaprenyl pyrophosphate synthase to bind and use analogs of FPP, IPP, or both. 

A "competitive" inhibitor is one that inhibits undecaprenyl pyrophosphate 
synthase activity by binding to the same form of undecaprenyl pyrophosphate 
synthase as its substrate binds-thus directly competing with the substrate for the 
10 active site of undecaprenyl pyrophosphate synthase. Competitive inhibition can be 
reversed completely by sufficiently increasing the substrate concentration. 

An "uncompetitive" inhibitor is one that inhibits undecaprenyl pyrophosphate 
synthase by binding to a different form of the enzyme than does the substrate. Such 
inhibitors bind to undecaprenyl pyrophosphate synthase already bound with the 
15 substrate and not to the free enzyme. Uncompetitive inhibition cannot be reversed 
completely by increasing the substrate concentration. 

A "non-competitive" inhibitor is one that can bind to either the free or substrate 
bound form of undecaprenyl pyrophosphate synthase. 

Those of skill in the art may identify inhibitors as competitive, uncompetitive or 
20 non-competitive by computer fitting enzyme kinetic data using standard equations 
according to Segel, I. H., Enzyme Kinetics, J. Wiley & Sons, (1975). 

The term "homologue" as used herein means a protein, polypeptide, 
oligopeptide, or portion thereof, having preferably at least 80%, more preferably at 
least 90% amino acid sequence identity with Staphylococcus undecaprenyl 
25 pyrophosphate synthase or any functional or structural domain of undecaprenyl 
pyrophosphate synthase. 

The term "co-complex" means undecaprenyl pyrophosphate synthase or a 
mutant or homologue of undecaprenyl pyrophosphate synthase in covalent or non- 
covalent association with a chemical entity or compound. 
30 The term "associating with" refers to a condition of proximity between a 

chemical entity or compound, or portions thereof, and an undecaprenyl 
pyrophosphate synthase molecule or portions thereof. The association may be non- 
covalent, wherein the juxtaposition is energetically favored by hydrogen bonding or 
van der Waals or electrostatic interactions, or it may be covalent. 
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The terms "beta sheet or /?-sheet" refers to the conformation of a polypeptide 
chain stretched into an extended zig-zig conformation. Portions of polypeptide chains 
termed strands that run "parallel" all run in the same direction, amino terminus to 
carboxy terminus. Polypeptide chains or portions thereof, termed strands, that are 
5 "antiparallel" run in the opposite directions. 

The term "binding site" refers to a region of the synthase comprised of amino 
acid residues and optionally cofactors to which a ligand can bind. Undecaprenyl 
pyrophosphate synthase has binding sites for at least farnesyl pyrophosphate and 
longer chain derivatives of FPP, isoprenyl pyrophosphate, magnesium ion, and 
10 sulfate ion. 

The term "active site" refers to any or all of the following sites in undecaprenyl 
pyrophosphate synthase: the FPP binding site, the IPP binding site, the site of the 
synthase reaction products and intermediates, the magnesium ion site, and the 
sulfate site. In one particular usage, "active site" refers to the site where the catalytic 

1 5 reaction occurs. 

The term "atomic coordinates" refers to mathematical coordinates derived 
from mathematical equations related to the patterns obtained on diffraction of a 
monochromatic beam of X-rays by the atoms (scattering centers) of an undecaprenyl 
pyrophosphate synthase molecule in crystal form. The diffraction data are used to 

20 calculate an electron density map of the repeating unit of the crystal. The electron 
density maps are used to establish the positions of the individual atoms within the unit 
cell of the crystal. The similar term "structure coordinates" refers to the mathematical 
coordinates of the individual atoms. 

The term "substantial portion" of atomic coordinates refers to a plurality of at 

25 least twelve atomic coordinates that define or partially define the location of several 
atoms in the synthase or ligand. Preferably, a substantial portion is at least 24 
coordinates. More preferably, a substantial portion is at least 36 coordinates. The 
coordinates can be within the standard deviation. 

The term "heavy atom derivatization" refers to a method of producing a 

30 chemically modified form of a crystal of undecaprenyl pyrophosphate synthase. In 
practice, a crystal is soaked in a solution containing heavy metal atom salts, or 
organometallic compounds, e.g., lead chloride, gold thiomalate, thimerosal or uranyl 
acetate, which can diffuse through the crystal and bind to the surface of the protein. 
The location(s) of the bound heavy metal atom(s) can be determined by X-ray 
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diffraction analysis of the soaked crystal. This information, in turn, is used to 
generate the phase information used to construct three-dimensional structure of the 
enzyme. Blundel, T. L. and N. L. Johnson, Protein Crystallography, Academic Press 
(1976). 

5 Those of skill in the art understand that a set of structure coordinates 

determined by X-ray crystallography is not without standard deviation. For the 
purpose of this invention, any set of structure coordinates for undecaprenyl 
pyrophosphate synthase or undecaprenyl pyrophosphate synthase homologues or 
undecaprenyl pyrophosphate synthase mutants that have a root mean square 
10 deviation of protein backbone atoms (N, C a , C and O) of less than 0.75 A when 
superimposed, using backbone atoms, on the structure coordinates listed in Figure 5 
shall be considered identical. 

The term "unit cell" refers to a basic parallelepiped shaped block. The entire 
volume of a crystal may be constructed by regular assembly of such blocks. Each 
15 unit cell comprises a complete representation of the unit of pattern, the repetition of 
which builds up the crystal. 

The term "space group" refers to the arrangement of symmetry elements of a 

crystal. 

The term "molecular replacement" refers to a method that involves generating 
20 a preliminary model of an undecaprenyl pyrophosphate synthase crystal whose 
structure coordinates are unknown, by orienting and positioning a molecule whose 
structure coordinates are known (e.g., undecaprenyl pyrophosphate synthase 
coordinates from Figure 5) within the unit cell of the unknown crystal so as best to 
account for the observed diffraction pattern of the unknown crystal. Phases can then 
25 be calculated from this model and combined with the observed amplitudes to give an 
approximate Fourier synthesis of the structure whose coordinates are unknown. 
This, in turn, can be subjected to any of the several forms of refinement to provide a 
final, accurate structure of the unknown crystal. Lattman, E., "Use of the Rotation and 
Translation Functions", in Methods in Enzymology, 115, pp. 55-77 (1985); M. G. 
30 Rossmann, ed., "The Molecular Replacement Method", Int. Sci. Rev. Sen, No. 13, 
Gordon & Breach, New York, (1972). Using the structure coordinates of 
undecaprenyl pyrophosphate synthase provided by this invention, molecular 
replacement may be used to determine the structure coordinates of a crystalline 
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mutant or homologue of undecaprenyl pyrophosphate synthase or of a different 
crystal form of undecaprenyl pyrophosphate synthase. 

"Atom type" in, for example, Figure 5, refers to the element whose 
coordinates are measured. The first letter in the column in Figure 5 defines the 
5 element. 

"X, Y, Z" crystallographically define the atomic position of the element 
measured. 

"B" is a thermal factor that measures movement of the atom around its atomic 

center. 

10 Atomic coordinates for undecaprenyl pyrophosphate synthase according to 

Figure 5 may be modified from this original set by mathematical manipulation. Such 
manipulations include, but are not limited to, crystallographic permutations of the raw 
structure coordinates, fractionalization of the raw structure coordinates, integer 
additions or subtractions to sets of the raw structure coordinates, and any 

15 combination of the above. The atomic coordinates of Figure 5 correspond to the 
undecaprenyl pyrophosphate synthase polypeptide chains, and to several molecules 
bound thereto, including magnesium ion, FPP, sulfate, and a plurality of water 
molecules. 

MATERIALS AND METHODS 
20 Cloning and expression 

S. aureus UPS was subcloned into the expression vector pET15b (Novagen). 
Expression was carried out in the E.coli strain BL21IDE3 in M9 minimal media 
containing seleno-L-methionine. Cells were grown overnight in methionine 
supplemented M9 seleno-L-methionine media at 37 degrees Celsius. 50mls of the 
25 overnight were inoculated into one liter of seleno-L-methionine labeling media. The 
cells were grown to an OD 600 of 0.651 at 37degrees Celsius and induced with 50 pM 
IPTG. The cells were allowed to grow for 18 hours at 37 degrees Celsius before 
harvest. A four-liter growth had a final pellet weight of 16.1 grams. 

M9 Seleno-L-methionine labeling media was prepared using the reagents 
30 and amounts shown below. 
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Reagent 


Amount Per Liter 


10X M9 salts* 


100 mis 


Amino Acid Mix** 


100 mis 


20% glucose 


20 mis 


Ampicillin stock (100mgs/ml in H 2 0) 


1 ml 


9.8% MgS0 4 (w/v) 


1 ml 


10 mM CaCI 2 


10 mis 


2 mg/ml Thiamine 


1 ml 


200X Seleno-L-methionine stock*** 


5 mis 



*1X concentration of M9 salts per liter (made up as 10X and autoclaved): 6.0 
gm Na 2 HP0 4 , 3 gm KH 2 P0 4 , 0.5 gm NaCI and 1.0 gm NH 2 CI. 
5 ** Amino acid mix 400 mg/l made up of each of the 19 AA other than 

methionine (D/L mix) made up in 20 mM NaP0 4 buffer, pH 7.0, filter sterilized. 

***200X selono-L-methionine stock made up of 5 mg per ml seleno-L- 
methionine in 50 mM NaP0 4 buffer, pH 7.0 

The components were combined and the final media was filter sterilized 
1 0 through a 0.2 pm filter. 

Methionine supplemented M9 seleno-L-methionine media was prepared 
with identical to reagents as listed above for seleno-L-methionine labeling media 
except that, instead of 5 mis of 200X seleno-L-methionine stock solution per liter, 
2.5 mis of seleno-L-methionine stock and 2.5 mis of L-methionine stock (5 mg per 
15 ml L-methionine in 50 mM NaP0 4 buffer, pH 7.0) were added per liter of media. 
Purification of native S. aureus UPS : 

Cells from a 1 liter shake flask culture of E. coli expressing the (his) 6 -UPS 
fusion protein were lysed in buffer containing 50 mM TrisCI, 0.3 M NaCI, 4 mM b-ME, 
pH 8.0, 1 ug/mL pepstatin, 1 ug/mL leupeptin and 1 mM PMSF, using a 

20 microfluidizer. Lysate was clarified by centrifugation at 40,000 x g. Soluble protein 
was applied to a 5 mL immobilized metal affinity column, Ni-NTA, which had been 
equilibrated in buffer A (50 mM TrisCI, 0.3 M NaCI, 4 mM b-ME, 20 mM imidazole, pH 
8.0 and 1 ug/mL pepstatin and leupeptin). After washing with buffer A + 40 mM 
imidazole, bound protein was step eluted with buffer B (buffer A + 0.25 M imidazole, 

25 pH 8.0). Fractions were analyzed by SDS-PAGE on 10% bis-tris gels in MES buffer 



-16- 

using the NuPAGE system. Fractions containing his-UPS were pooled for further 
purification. 

The (his) 6 tag was removed by digestion with thrombin at a specific thrombin 
recognition site. (His) 6 -UPS was treated with thrombin at a 1:400 thrombin : (his) 6 - 
5 UPS ratio. The reaction was stopped by addition of 1 mM PMSF after 2.5 hrs. at 
room temperature. Chromatography was performed as described above for isolation 
of (his) 6 -UPS. Untagged UPS was isolated in the flowthrough and wash fractions. 
The remaining (his) 6 -UPS and the cleaved (his) 6 peptide were removed in the bound 
fraction. Fractions were analyzed by SDS-PAGE on 10% bis-tris gels in MES buffer 
10 using the NuPAGE system. Fractions containing des-his-UPS were pooled for further 
purification. 

Des-his-UPS was further purified by size exclusion chromatography. The 
protein was concentrated to 10-12 mg/mL and loaded onto a 125 mL Superdex 200 
prep grade column which had been equilibrated in buffer containing 50 mM TrisCI, 

15 0.3 M NaCI, 8 mM DTT, pH 7.5. UPS eluted as a 50 kDa dimer. Fractions were 
analyzed by SDS-PAGE on 10% bis-tris gels in MES buffer using the NuPAGE 
system. The final pool was characterized by dynamic light scattering (DLS), and 
LCMS. DLS revealed that the protein was monodisperse. Mass analysis revealed a 
small level of truncated N-terminus consistent with the loss of 7 amino acids. This 

20 was confirmed by N-terminal sequence analysis. Protein isolated in the manner 
described above was used for crystallization. 

Crystallization of native S. aureus UPS : 

A ternary complex of des-his-UPS: trans-trans-FPP: Mg was made in 50 mM 
TrisCI, 0.3 M NaCI, 8 mM DTT, pH 7.5. Sparse matrix screening using the hanging 
25 drop method was performed. Drops were set at a protein concentration of 5 mg/mL. 
Plates were incubated at 22°C. Two leads were identified, where protein 
concentration was 5-8 mg/ml over reservoir containing either 0.1 M Bicine, pH 9, and 
2-2.4 M (NH 4 ) 2 S0 4 or 0.1 M NaMES, pH 6.5, 0.2 M (NH 4 ) 2 S0 4 , and 30% PEG MME 
5000. 

30 Dendritic crystals obtained by spontaneous nucleation grew from drops where 

the protein: ligand complex was 5 mg/mL over reservoir solution containing 0.1 M 
NaMES, pH 6.5, 0.2 M (NH 4 ) 2 S0 4 , and 30% PEG MME 5000. Seed stocks made 
from these crystals were used to grow bi-pyramid crystals for x-ray diffraction 
experiments. Drops of 2 uL complex + 2 uL reservoir solution were set with reservoir 
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solutions containing 0.1 M NaMES, pH 6.5, 0.2 M (NH 4 ) 2 S0 4 , and 21, 24 and 30% 
PEG MME 5000. These drops were micro-seeded by serial dilution of the seed stock 
through 5 identical drops. These drops equilibrated over the appropriate reservoir 
solution at 22°C. As the seeds diluted out, bi-pyramidal crystals - 250 x 100 microns 
5 grew. 

Purification of selenomethionine labeled S. aureus UPS : 
To solve the structure using the method of MAD-phasing, protein labeled with 
selenium was expressed. Selenomethionine substituted UPS was purified by the 
method outlined for native UPS. Mass analysis of the purified untagged product 
10 revealed 3 components. The primary species observed was consistent with 7 
selenium additions. A minor species was detected consistent with 6 selenium 
additions. An additional minor species was detected consistent with 6 selenium 
additions to the N-terminally truncated species (minus 7 amino acids, one of which is 
a methionine). 

15 Crystallization of selenomethionine labeled S. aureus UPS : 

The condition used for crystallization of the native UPS was used for labeled UPS 
Selenomethionine labeled UPS : t-t-FPP : Mg complex was set in drops as described 
above for the growth of native UPS crystals. Native seeds from dendrites were used 
to cross seed into drops set using the labeled protein. A second round of micro- 

20 seeding was performed using crystals grown from selenomethioine labeled UPS. 
Using this method, bi-pyramidal crystals were grown for MAD data collection using 
synchrotron radiation. Data from these crystals were used to solve the structure 

The sequence of the construct used in crystallization experiments given in 
SEQ ID NO:1. 

25 Data collection 

X-ray diffraction data were collected from flash-frozen crystals at 100K. 
Crystals were briefly soaked in a cryoprotectant solution which consisted of 10% 
MPD (2,4 methyl pentane diol) added to the crystallization reservoir solution. They 
were then introduced into a 100°K cold nitrogen stream. 

30 Crystals are tetragonal, belong to the space group P4 1 2 1 2, with unit cell 

dimensions of a=58.19, b=58.19, c=1 59.26 A. 

Three-wavelength diffraction data to a maximum resolution of 1.83A were 
collected at beamline X12C of the National Synchrotron Light Source at Brookhaven 
National Laboratory, using a Brandeis 2x2 ccd detector. Data were reduced using the 
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HKL suite of programs, and truncated to F's using the program TRUNCATE. Crystal 
information: Space group P4 1 2 1 2: Cell dimensions a=b=58.19 A c=1 59.26 A 
a=b=g=90°. Data statistics are shown in Table 1A below. Values in parentheses are 
for the highest resolution shell. R sym = S(l-<l>)/S<l> 

5 



Crystal 


dmin A 


N un jque 


Redundancy 


<l/s> 


Rsym (%) 


Se-met 11 
(0.9789A) 


1.83 


24,214 


4.4 


14.5(1.8) 


8.7(43.8) 


Se-met 12 
(0.9785 A) 


1.83 


15,892 


12.3 


15.5(2.2) 


8.0(38.9) 


Se-met 13 
(0.9500 A) 


1.83 


16,202 


12.7 


16.8(3.0) 


6.8(33.9) 



Table 1A 

Structure determination 

The synthase structure was solved using the program SOLVE. Se-atom 
10 parameter refinement and MAD phasing were carried out using SHARP, and 
solvent flipped maps were calculated using SOLOMON. Phasing statistics are in 
shown in Table 1B. 





11 Centric 


11 Acentric 


12 centric 


12 acentric 


13 centric 


13 Acentric 


Phasing 
power* 


2.71- 


2.9/2.0 


3.7/- 


3.6/2.0 


-/- 


-/1.8 


D ** 

■^cullis 


42.5/- 


51.7/71.9 


40.7/- 


48.3/71 .2 


-/- 


-/78.5 



15 Table 1B 

*Phasing power (isomorphous/anomalous) = <[|Fh(calc)|/E>, 

**Rcuiij S (isomorphous/anomalous) = E/<|F PH - F P |>, where E is phase-integrated lack of 
closure error and F PH and F P are heavy-atom and protein structure factors 
20 respectively. 
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Figure of Merit (FOM) statistics as a function of resolution in shown in Table 
1 C below. 



Resolution 
(A) 


5.09 


3.63 


2.97 


2.58 


2.31 


2.11 


1.95 


1.83 


Overall 


FOM 

(acentric) 


0.83 


0.73 


0.70 


0.65 


0.55 


0.42 


0.34 


0.32 


0.50 


FOM 
(centric) 


0.71 


0.58 


0.55 


0.52 


0.51 


0.39 


0.32 


0.28 


0.48 



5 Table 1C 

Automated map-tracing, as implemented in ARP/wARP was used to trace the 
map, and the program was able to automatically fit 201 out of 259 residues into the 
1.83A solvent-flipped map. The remaining residues were fit manually, using the 
10 program O. All but the first 18 residues were successfully fit into density. Clear 
electron density was seen for bound FPP and Mg 2+ , as well as for a bound sulfate 
ion. Bound water molecules were automatically picked using ARP. All model 
refinement was carried out using the program REFMAC as implemented in the CCP4 
suite of programs. 

15 The final refined model consists of residues 19-256 of undecaprenyl 

pyrophosphate synthase, FPP, Mg 2+ , sulfate, and 222 bound water molecules. This 
model has an overall R factor of 0.19, with a free R-factor of 0.24, for all data to 
1 .83A. Refinement statistics are in Table 1 D. 
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Resolution 


50.0-1. 83A 


Reflections 


22,639 (all data) 


Rwoik 


19.35% 


Rfree 


23.44% 


RMSD in bond lengths 


0.01 6A 


RMSD in bond angles 


1.74° 


Total atoms (non-hydrogen) 


2204 


Ramachandran Plot Most favoured 


87.9 


Ramachandran Plot Allowed 


12.1 



Table 1D 



Final refined coordinates of S. Aureus UPS are shown in Figure 5. 

5 

The Structure of Staphylococcus Undecaprenvl Pyrophosphate Synthase 
Turning to Figure 1, the overall topology of the synthase is shown 
schematically to consist of a six-stranded parallel jff-sheet, surrounded by six a- 
helices and four 3 10 helices. 

10 Turning to Figure 2a and 2b, analysis of the crystal packing reveals that 

undecaprenyl pyrophosphate synthase exists as a dimer, with two identical subunits 
related by a 2-fold axis of symmetry. Figures 2a and 2b show ribbon drawings of the 
undecaprenyl pyrophosphate synthase dimer. As depicted in the figures, the two 
subunits are intimately associated. The ligand binding sites are at the top of the 

1 5 figure 2a, as indicated. 

Turning to Figure 3, the crystal structure reveals details of the binding sites of 
the substrate FPP and the cofactor Mg 2+ . Figure 3 shows the detailed interactions of 
these molecules with the protein, including interatomic distances. 

The residues from Ser 78 to Val 89 form a coil (specifically a coil-helix-coil, also 

20 termed a loop), which is fully ordered in the disclosure of the instant invention, and 
forms part of the entrance to the catalytic cleft. Arg 84 makes two hydrogen bonds with 
a phosphate group of FPP (see Figure 3). The loop has clearly defined atomic 
coordinates in the structure according to the instant invention. The loops 
corresponding to this coil are disordered, that is, not defined, in the structures of 

25 undecaprenyl pyrophosphate synthase from M. luteus as determined by Fujihashi et 
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al. (2001) and from E. coli as determined by Ko et al. (2001). Neither Fujihashi nor 
Ko included substrate or inhibitor in their crystals. This difference suggests that the 
loop is flexible in the absence of substrate, or other suitable ligand, and the loop will 
occasionally be referred to as "flexible." 

Moreover, as shown in Figure 3, the His 50 , Leu 95 , Phe", Pro 96 , Phe 148 and 
Ala 76 , have hydrophobic interactions with the isoprenoid tail of farnesyl 
pyrophosphate. 

Turning to Figure 4, a sulfate ion is bound to a second ligand binding site 
close to the FPP binding pocket. The second ligand binding site is considered to 
define part of the binding pocket for the second substrate, IPP. 

Thus, the active site is defined as consisting of at least one of the following 
residues: Asp 33 , Gly 34 , Gly 36 , Arg 37 , Arg 46 , Ala 76 , Arg 84 , Leu 95 , Pro 96 , Phe 148 , Arg 201 , 
Arg 207 , and Ser 207 , from the one chain, and Glu 220 and Gly 251 from the other chain, 
denoted chain B. In mutants or homologs of S. aureus undecaprenyl pyrophosphate 
synthase the numbering of amino acid residues can be normalized to the S. aureus 
reference sequence. 
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The S. aureus undecaprenyl pyrophosphate synthase has several structural 
features indicated in Table 1E below. 



His 2 ' -lie 31 


S1 


Asn 38 -Lys 40 


H1 


Arg 46 -lie 67 


H2 


Tyr /1 -Ser' 8 


S2 


Asn 81 -Ser* 3 


H3a 


Glu 86 -Glu 109 


H3b 


Lys 113 -lie 1 " 


S3 


Thr 120 -Lys 122 


H4 


Lys 12 '' -Thr 138 


H5 


Lys 14b -Tyr 152 


S4 


Gly 184 -His 170 


H6 


Ser 1 ' 8 -lie 1 ' 8 


H7 


Glu 181 -Asn 188 


H8 


Leu 198 -Arg 201 


S5 


Glu 220 -Phe 223 


S6 


Trp 228 -Asp 230 


H9 


Glu 233 -Ser 248 


H10 



5 TABLE 1 E 

Undecaprenyl Pyrophosphate Synthase Secondary Structure Assignments 

In the Table 1E, beta-strands are labeled S1-S6 and helices are labeled H1- 
10 H10. The helices H3a, H4, H7 and H9 are 3 10 helices; the others are alpha helices. 
Secondary structures have been calculated according to the method of Kabsch and 
Sander, as implemented in the program Procheck. Other algorithms used to calculate 
secondary structure can produce slightly different assignments. 

The S. aureus undecaprenyl pyrophosphate synthase has several notable 
15 structural features, including the following. The amino acid residues from position 27- 
31 are part of a beta sheet strand termed S1. H1 is an alpha helix immediately 
adjacent to a catalytic aspartic acid in position 33 and also has several FPP binding 
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residues including Gly 34 , Asn 35 , Gly 36 and Arg 37 . The amino acid residues from 
position 46 to 67 form an alpha helix termed H2. The amino acid residues from 
position 71 to 78 are part of a beta sheet strand termed S2. The amino acid residues 
from position 78 to 89, described above as capable of being a flexible loop, are 
5 clearly visible in the structure provided and include Arg 84 that makes two hydrogen 
bonds with a phosphate group of FPP. This sequence also includes a 3 10 helix and 
part of an alpha helix when the substrate is present. Thus, the amino acid residues 
from Asn 81 to Ser 83 from a 3 10 helix termed H3a. 

Moreover, the synthase has other notable features. The amino acids from 

10 Glu 86 to Glu 109 form an alpha helix termed H3b. Of interest, the H3b helix includes a 
proline that may allow flexibility in the structure. The amino acid residues from Lys 113 
to He 117 form a part of a beta sheet termed S3. The amino acid residues from Thr 120 
to Lys 122 form a 3 10 helix termed H4. The amino acids from Lys 125 to Thr 138 form an 
alpha helix termed H5. The amino acids from Lys 145 to Tyr 152 form part of a beta 

15 sheet termed S4. The amino acid residues from Gly 154 to His 170 form an alpha helix 
termed H6. The amino acid residues from Ser 176 to He 178 form a 3 10 helix termed H7. 
The amino acid residues from Glu 181 to Asn 185 form an alpha helix termed H8. The 
amino acid residues from Leu 198 to Ang 201 form a strand of beta sheet termed S5. 
The amino acid residues from Glu 220 to Phe 223 form a strand of beta sheet termed S 6 . 

20 The amino acid residues from Trp 228 to Asp 230 form a 3 i0 helix termed H9. The 
amino acid residues from Glu 233 to Ser 245 form an alpha helix termed H10. 

The crystal structure clearly provides a description of the interaction of the 
amino acid residues of undecaprenyl pyrophosphate synthase with farnesyl 
pyrophosphate and the magnesium ion cofactor. The Asp 33 has a carboxylic acid 

25 functional group in the beta position, the oxygen atom of which interacts with the 
magnesium ion at a distance of about 2.12A. This metal coordination serves to lock 
the magnesium ion into a position to interact with two oxygen atoms of the 
pyrophosphate group of farnesyl pyrophosphate at intermolecular distances of about 
2.02A. 

30 The synthase interaction with FPP is also mediated by other amino acid 

residues. Arg 37 has nitrogen atoms that interact with the oxygen atoms of the 
phosphates in farnesyl pyrophosphate, with nitrogen to oxygen hydrogen bond 
interactions having distances of about 2.38A, about 2.84A and about 2.84A. The 
Gly 36 has an alpha amino group that forms a hydrogen bond with the bridge oxygen 
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of farnesyl pyrophosphate having a distance of about 3.24A between the nitrogen 
and oxygen groups. The Gly 34 has an alpha amino group that also interacts with the 
same oxygen atom as shown in Figure 3 with a nitrogen to oxygen distance of about 
3.31 A. The Arg 46 has two nitrogen atoms in the guanidino functional group that form 
5 hydrogen bonds with an oxygen of the terminal phosphate group in farnesyl 
pyrophosphate and are characterized by nitrogen to oxygen interatomic distances of 
about 2.87 and about 3.17A. The Arg 84 has a guanidino group having two oxygens 
that interact with two oxygen atoms of the phosphate group of farnesyl 
pyrophosphate forming hydrogen bonds with inter-atomic distances of about 2.84 and 
10 about 2.77. 

Deficiencies in Other Studies 

Comparison of the results of the present invention with deductions by others 
illustrates the limits of those studies. In regard to the flexible loop, Ko, et al. 
speculates that the loop is "responsible mainly for IPP binding," and that Trp-75, 

15 corresponding to Trp 82 in S. aureus undecaprenyl pyrophosphate synthase, "is 
involved in FPP binding." The data of the instant invention indicate otherwise. 
Rather, Arg 84 forms a pair of hydrogen bonds to FPP. Ala 76 and Leu 95 , which are 
adjacent to the flexible loop form hydrophobic interactions with the isoprenoid tail of 
FPP, but Trp 82 does not. Ko et al. also states that the glutamate that corresponds to 

20 Glu 88 in S. aureus undecaprenyl pyrophosphate synthase, is critical in substrate 
binding and/or catalysis. As undecaprenyl pyrophosphate synthase has two 
substrates, FPP and IPP, Ko et al.'s statement is ambiguous. The data of the instant 
invention clearly indicate, however, that Glu 88 makes hydrogen bonds to Arg84 and 
Arg46, which in turn stabilize the phosphate group of FPP. Thus Glu88 can be said to 

25 be indirectly involved in substrate binding. 

Fujihashi et al. states that their results indicate that Trp 82 (in the S. aureus 
nomenclature) is the residue binding FPP. In contrast the instant invention shows no 
interaction of Trp 84 with FPP. 

Fujihashi et al. speculates that Arg 201 and Arg 207 (in the S. aureus 

30 nomenclature) could interact with the IPP head group, and that, except for these 
arginines, no conserved residues are found that could bind the pyrophosphate part of 
IPP. In contrast, the measurements of the instant invention clearly show a hydrogen 
bond between the hydroxyl group of a high conserved Ser 209 and the IPP-mimetic 
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sulfate. Moreover, the instant invention has identified a highly conserved Gly 251 , the 
nitrogen of which coordinates with the sulfate ligand. 

One aspect of the invention relates to a composition comprising 
Staphylococcus undecaprenyl pyrophosphate synthase in crystalline form and a 
5 ligand. In general, the ligand can be a substrate, inhibitor, or co-factor. More 
specifically, the ligand can be selected from the group consisting of magnesium ion, 
farnesyl pyrophosphate, isopentyl pyrophosphate, sulfate ion, and any inhibitor that 
binds to a substrate binding site. The inhibitor can be any inhibitor of the synthase, 
including a low affinity or high affinity inhibitor. In one aspect the crystal comprises 
10 ligands, or parts thereof, having atomic coordinates according to Figure 5, or portions 
thereof. 

In another aspect the crystalline undecaprenyl pyrophosphate synthase 
comprises amino acid residues having atomic coordinates according to Figure 5, or a 
substantial portion thereof. The synthase is a dimer of identical polypeptide chains. 

15 In another aspect, the undecaprenyl pyrophosphate synthase comprises an amino 
acid sequence corresponding to residues 1-275 of SEQ ID NO:1. SEQ ID NO:1 is 
the amino acid sequence of S. aureus undecaprenyl pyrophosphate synthase, using 
the three letter amino acid notation. The amino acid denoted number 12 is the first 
amino acid of the wild-type sequence. The amino acids denoted 1-11 denote amino 

20 acids used in the construct. 

The invention also relates to first and second ligand binding sites of 
undecaprenyl pyrophosphate synthase. The first and second ligand binding sites are 
defined by amino acid residues that interact with the polar or ionic head group of the 
ligands and, optionally, with other amino acid residues that interact with a 

25 hydrophobic tail of the ligand. Alternatively, the amino acid residues of the binding 
sites can interact indirectly with the substrate, for example, by binding to a cofactor 
which in turn binds to a substrate, or by binding to another amino acid residue which 
in turn binds to a substrate. 

The first ligand binding site can be defined as comprising at least one amino 

30 acid residue selected from the group consisting of Asp 33 , Gly 34 , Gly 36 , Arg 37 , Arg 46 , 
Ala 76 , Arg 84 , Leu 95 , Pro 96 , and Phe 148 . In a preferred embodiment, the first ligand 
binding site comprises at least three of these amino acid residues. In a yet more 
preferred embodiment, the first ligand binding site comprises at least six of these 
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amino acid residues. In a most preferred embodiment, the first ligand binding site 
comprises all ten amino acid residues. 

The first ligand binding site can alternatively comprise at least about 80% of 
the amino acid residues selected from the group consisting of Asp 33 , Gly 34 , Gly 36 , 
5 Arg 37 , Arg 46 , Ala 76 , Arg 84 , Leu 95 , Pro 96 , and Phe 148 . In a preferred embodiment, the 
first ligand binding site comprises at least about 90% of the amino acid residues 
selected from the group consisting of Asp 33 , Gly 34 , Gly 36 , Arg 37 , Arg 46 , Ala 76 , Arg 84 , 
Leu 95 , Pro 96 , and Phe 148 . 

The second ligand binding site can be defined as comprising at least one 

10 amino acid residue selected from the group consisting of Asp 33 , Arg 201 , Arg 207 , and 
Ser 209 from one chain (A) of the dimer, and Glu 220 and Gly 251 from the other chain (B) 
of the dimer. In a preferred embodiment, the second binding site comprises at least 
three of these amino acid residues. In a more preferred embodiment, the second 
binding site comprises all of these amino acid residues. 

15 The second ligand binding site can alternatively comprise at least about 80% 

of the amino acid residues selected from the group consisting of Asp 33 , Arg 201 , Arg 207 , 
Ser 209 , Glu 220 ^), and Gly 251 ^). In a preferred embodiment, the second ligand 
binding site comprises at least about 90% of the amino acid residues selected from 
the group consisting of Asp 33 , Arg 201 , Arg 207 , Ser 209 , Glu 220 (B), and Gly 251 (B). 

20 Another aspect of the invention relates to a method of designing or identifying 

a potential ligand for an undecaprenyl pyrophosphate synthase, the method 
comprising using a three-dimensional structure including atomic coordinates of amino 
acid residues 33, 34, 36, 46, 76, 84, 95, 96, and 148, according to Figure 5. In a 
preferred embodiment, the coordinates are those of S. aureus undecaprenyl 

25 pyrophosphate synthase, or a substantial portion thereof. The method can include 
obtaining the potential ligand which can include synthesizing the ligand in whole or in 
part, borrowing the ligand, and purchasing the ligand. 

In one aspect the invention is directed to a computational model of a 
composition comprising an undecaprenyl pyrophosphate synthase having atomic 

30 coordinates of S. aureus undecaprenyl pyrophosphate synthase, or a portion thereof, 
and a computer program running on a computer addressing the atomic coordinates. 
The atomic coordinates can be those of Figure 5, or a substantial portion thereof. 

In another aspect, the invention is directed to a method of designing or 
identifying a ligand or a potential inhibitor of a second undecaprenyl pyrophosphate 
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synthase comprising: (a) using a three-dimensional structure of a first undecaprenyl 
pyrophosphate synthase, as defined by atomic coordinates according to Figure 5, or 
a substantial portion thereof; (b) identifying at least one first amino acid residue 
having a first peptide backbone and the amino acid residue(s) defining, in part, in at 
5 least one ligand binding site; (c) employing protein alignment means to identify in the 
second undecaprenyl pyrophosphate synthase at least one second amino acid 
residue having a second peptide backbone that is capable of substantially aligning 
with the first backbone; (d) employing the three-dimensional structure to design or 
select the potential ligand for the second undecaprenyl pyrophosphate synthase; (e) 

10 synthesizing the potential ligand; and (f) contacting the potential ligand with the 
second undecaprenyl pyrophosphate synthase to determine binding to the second 
undecaprenyl pyrophosphate synthase; wherein the second amino acid residue 
differs from the first amino acid residue. 

In yet another aspect, the invention is directed to a computational model of an 

15 active site of an isolated undecaprenyl pyrophosphate synthase comprising a 
magnesium ion cofactor and a polypeptide comprising a first arginine residue having 
a guanidino group having nitrogen atoms, and an aspartic acid residue comprising 
oxygen atoms forming an acid functional group, wherein the oxygen atoms 
coordinate with the cofactor and at least one nitrogen atom of the guanidino group of 

20 the first arginine residue; and a second arginine residue in a polypeptide loop 
comprising the sequence Glu Asn Trp Xaa Arg Pro (SEQ ID NO:2). The Arg has at 
least one nitrogen atom capable of coordinating an atom of a ligand. Xaa is any 
amino acid residue, including hydrophilic, hydrophobic, and ionic amino acid 
residues. The cofactor, aspartic acid residue, first arginine residue, and second 

25 arginine residue form at least a part of an active site of an undecaprenyl 
pyrophosphate synthase. In one embodiment Xaa is Ser. 

Moreover, the active site can further comprise a third arginine in an alpha- 
helix comprising the sequence Asp Gly Asn Xaa Arg (SEQ ID NO:3), the Arg having 
at least two nitrogen atoms capable of coordinating an atom of a ligand. Xaa in SEQ 

30 ID NO:3 is any amino acid residue, including hydrophilic, hydrophobic, and ionic 
amino acid residues. In one embodiment, Xaa is Gly. 

The invention is also directed to a computational model of an active site 
comprising a representation of the active site of undecaprenyl pyrophosphate 
synthase by a computer program capable of running on a computer. 
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In one aspect, the invention is directed to a computational model of a 
composition comprising an undecaprenyl pyrophosphate synthase having at least 
twelve of the atomic coordinates of S. aureus undecaprenyl pyrophosphate synthase 
and a computer program running on a computer addressing the atomic coordinates. 
5 Preferably, the model comprises at least twenty-four, more preferably at least 36 
atomic coordinates, and most preferably at least 48 atomic coordinates. 

The computational model can further comprise an amino acid residue 
sequence Asp Gly Asn Gly Arg Trp (SEQ ID NO:4), the Arg having at least one 
nitrogen atom, and a ligand comprising at least one oxygen atom, wherein the at 
10 least one nitrogen atom abuts the oxygen atom by about 2.4A. 

In another embodiment, the computational model can further comprise an 
amino acid residue sequence Asp Gly Asn Gly Arg Trp (SEQ ID NO:4), each Gly 
having a nitrogen atom, and a ligand comprising at least one oxygen atom, wherein 
the nitrogen atom abuts the oxygen by about 3.3A. 
15 In yet another embodiment, the computational model can alternatively further 

comprise an amino acid residue sequence Glu Asn Trp Ser Arg Pro (SEQ ID NO:5), 
the Arg having at least one nitrogen atom, and a ligand comprising at least one 
oxygen atom, wherein the nitrogen atom abuts the oxygen atom by about 2.9A. 

In still another embodiment, the computational model can alternatively further 
20 comprise an amino acid residue sequence Pro Arg Me Lys Gly His (SEQ ID NO:6), 
the Arg having at least one nitrogen atom, and a ligand comprising at least one 
oxygen atom, wherein the nitrogen atom abuts the oxygen atom by about 3A. 

Also, the computational model can alternatively further comprise an amino 
acid residue sequence Arg Tyr Ser Asn Phe Leu (SEQ ID NO:7), the Ser having one 
25 nitrogen atom, and a ligand having at least one oxygen atom wherein the nitrogen 
atom abuts the oxygen atom by about 2.6 angstroms. 

Design of Undecaprenyl pyrophosphate synthase inhibitors 
One of skill in the art of molecular modeling can use any method to screen 
chemical moieties for the ability to associate with undecaprenyl pyrophosphate 
30 synthase or with the Mg 2 \ FPP, or IPP binding sites that comprise part of the 
undecaprenyl pyrophosphate synthase active site. Visual inspection of a model of 
the ligand binding sites based on the undecaprenyl pyrophosphate synthase 
coordinates in Figure 5 can lead to candidate chemical entities. Selected chemical 
moieties can then be positioned in orientations within one of the ligand binding sites 
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of undecaprenyl pyrophosphate synthase. Positioning can be accomplished using 
software such as Quanta and Sybyl and is useful for changing the positions of 
chemical entities. Then standard molecular mechanics forcefields, such as 
CHARMM and AMBER can be used to minimize the energy and molecular kinetics of 
5 binding. 

Other computer programs useful in selecting chemical moieties include: 

1. DOCK (Kuntz, I. D. et al., "A Geometric Approach to Macromolecule- 

Ligand Interactions", J. Mol. Biol., 161, pp. 269-288 (1982)). DOCK is available from 

University of California, San Francisco, Calif. 
10 2. GRID (Goodford, P. J., "A Computational Procedure for Determining 

Energetically Favorable Binding Sites on Biologically Important Macromolecules", J. 

Med. Chem., 28, pp. 849-857 (1985)). GRID is available from Oxford University, 

Oxford, UK. 

3. AUTODOCK (Goodsell, D. S. and A. J. Olsen, "Automated Docking of 
15 Substrates to Proteins by Simulated Annealing", Proteins: Structure. Function, and 

Genetics, 8, pp. 195-202 (1990)). AUTODOCK is available from Scripps Research 
Institute, La Jolla, Calif. 

4. MCSS (Miranker, A. and M. Karplus, "Functionality Maps of Binding 
Sites: A Multiple Copy Simultaneous Search Method." Proteins: Structure. Function 

20 and Genetics, 11, pp. 29-34 (1991)). MCSS is available from Molecular Simulations, 
Burlington, Mass. 

Selected moieties can be assembled into a single compound by initial visual 
review of the organization of the parts to make a whole in relation to the atomic 
coordinates of undecaprenyl pyrophosphate synthase. Model building with software 
25 such as Quanta or Sybyl can supplement the process. 

Other programs useful in building chemical moieties into a ligand or inhibitor 

include: 

1. CAVEAT (Bartlett, P. A. et al, "CAVEAT: A Program to Facilitate the 
Structure-Derived Design of Biologically Active Molecules". In "Molecular 
30 Recognition in Chemical and Biological Problems", Special Pub., Royal Chem. Soc, 
78, pp. 182-196 (1989)). CAVEAT is available from the University of California, 
Berkeley, Calif. 
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2. 3D Database systems such as MACCS-3D (MDL Information 
Systems, San Leandro, Calif.). See also, Martin, Y. C, "3D Database Searching in 
Drug Design", J. Med. Chern., 35, pp. 2145-2154 (1992)). 

3. HOOK (available from Molecular Simulations, Burlington, Mass.). 

5 An undecaprenyl pyrophosphate synthase inhibitor or ligand can be prepared 

one moiety at a time, as described. Moreover, inhibitory or other undecaprenyl 
pyrophosphate synthase binding compounds can be designed "de novo" using either 
a vacant binding site or with moieties of a known inhibitor. Computer programs that 
support this approach include: 

10 1. LEGEND (Nishibata, Y. and A. Itai, Tetrahedron, 47, p. 8985 (1991)). 

LEGEND is available from Molecular Simulations, Burlington, Mass. 

2. LUDI (Bohm, H.-J., "The Computer Program LUDI: A New Method for 
the De Novo Design of Enzyme Inhibitors", J. Comp. Aid. Molec. Design, 6, pp. 61-78 
(1992)). LUDI is available from Biosym Technologies, San Diego, Calif. 

15 3. LeapFrog (available from Tripos Associates, St. Louis, Mo.). 

Variations on molecular modeling can be useful in this invention and include: 
Cohen, N. C. et al., "Molecular Modeling Software and Methods for Medicinal 
Chemistry", J. Med. Chem., 33, pp. 883-894 (1990); and Navia, M. A. and M. A. 
Murcko, "The Use of Structural Information in Drug Design", Current Opinions in 

20 Structural Biology, 2, pp. 202-210 (1992). 

The efficiency of a model ligand binding to undecaprenyl pyrophosphate 
synthase can be evaluated and optimized by computation. For example, an effective 
undecaprenyl pyrophosphate synthase inhibitor can induce a relatively small 
deformation upon binding, that is, the energy in the bound and free states would be 

25 similar. Thus, in one embodiment undecaprenyl pyrophosphate synthase inhibitors 
should preferably have a deformation energy upon binding of about 8 kcal/mole or 
less. In the case where undecaprenyl pyrophosphate synthase inhibitors can bind to 
the synthase in more than one conformation the deformation binding energy is the 
difference between the average energy of the bound conformations less the energy in 

30 free solution. Further enhancement of binding can be achieved by computational 
repulsive charge interaction between the ligand and the synthase. In a similar 
manner, dipole-dipole interactions can be reduced. Advantageously, the net dipole- 
dipole and charge interactions between ligand and undecaprenyl pyrophosphate 
synthase favor binding. 
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Computer software useful to evaluate energies of deformation and of 
electrostatic repulsion and attraction include: Gaussian 92, revision C, M. J. Frisch, 
Gaussian, Inc., Pittsburgh, Pa. ©1992; AMBER, version 4.0, P. A. Kollman, University 
of California at San Francisco, ©1994; QUANTA/CHARMM, Molecular Simulations, 
5 Inc., Burlington, Mass. ©1994; and Insight ll/Discover (Biosysm Technologies Inc., 
San Diego, Calif. ©1994). These applications can be used on suitable workstations. 
Other hardware systems and software packages will be known to those skilled in the 
art. 

A model undecaprenyl pyrophosphate synthase-binding compound can then 

10 be modified by changing functional groups to improve binding or inhibitory properties. 
The modified group can be similar to the size, volume and distribution of polar and 
hydrophobic functional groups as the model compound or it can differ. Modified 
compounds can be analyzed for fit to undecaprenyl pyrophosphate synthase by the 
computer modeling methods described above. 

15 One aspect of the invention comprises a method of identifying an inhibitor 

capable of binding to and inhibiting the enzymatic activity of an undecaprenyl 
pyrophosphate synthase, comprising: (a) introducing into a suitable computer 
program information defining the binding site of the undecaprenyl pyrophosphate 
synthase comprising first atomic coordinates of amino acids capable of binding to a 

20 substrate, wherein the program displays the three-dimensional structure thereof; (b) 
creating a three dimensional model of a test compound in the computer program; (c) 
displaying and superimposing the model of the test compound on the structure of the 
active site; (d) assessing whether the test compound model fits spatially into the 
active site; (e) incorporating the test compound in a biological synthase activity assay; 

25 and (f) determining whether the test compound inhibits enzymatic activity in the 
assay. 

The method can further comprise introducing into the computer program 
second atomic coordinates of water molecules bound to the substrate. Thereby, the 
free energy of binding of the potential inhibitor can include displacement of bound 
30 water. 

In one embodiment, the method comprises introducing into the computer 
program an amino acid residue sequence of the synthase, or portion thereof. In one 
preferred embodiment, the method comprises introducing into the computer program 
third atomic coordinates of a first 3 10 helix of the synthase, comprising the sequence 
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Asn-Trp-Ser. In another embodiment, the method further comprises introducing into 
the computer program fourth atomic coordinates of at least one synthase structural 
element selected from the group consisting of an alpha helix, a second 3 10 helix, a 
strand of beta sheet, and a coil. 
5 In yet another embodiment of the method, the undecaprenyl pyrophosphate 

synthase structural elements consist essentially of a coil and (a) a first beta sheet 
strand consisting of His He Ala He He (SEQ ID NO:8), or homolog thereof, and a 
second coil; (b) a first alpha helix consisting of Asn Gly Arg Trp Ala Lys (SEQ ID 
NO:9), or homolog thereof, and a third coil; (c) a second alpha helix consisting of Arg 

10 lie Lys Gly His Tyr Glu Gly Met Gin Thr He Lys Lys He Thr Arg Val Ala Ser Asp He 
(SEQ ID NO: 10), or homolog thereof, and a fourth coil; (d) a second beta sheet 
strand consisting of Tyr Leu Thr Leu Tyr Ala Phe Ser (SEQ ID NO:11), or homolog 
thereof, and a fifth coil; (e) a first 3 10 helix consisting of Asn Trp Ser, and a sixth coil; 
(f) a third alpha helix consisting of Glu Ser Glu Val Asn Tyr lie Met Asn Leu Pro Val 

15 Asn Phe Leu Lys Thr Phe Leu Pro Glu Leu He Glu (SEQ ID NO: 12), or homolog 
thereof, and a seventh coil; (g) a third beta sheet strand consisting of Lys Val Glu Thr 
He (SEQ ID NO: 13), or homolog thereof, and an eighth coil; (i) a second 3 10 helix 
consisting of Thr Asp Lys, and a ninth coil; G) a fourth alpha helix consisting of Lys 
Ser Thr He Glu Ala He Asn Asn Ala Lys Glu Lys Thr (SEQ ID NO: 14), or homolog 

20 thereof, and a tenth coil; (k) a fourth beta sheet strand consisting of Lys Leu He Phe 
Ala lie Asn Tyr (SEQ ID NO: 15), or homolog thereof, and an eleventh coil; (I) a fifth 
alpha helix consisting of Gly Arg Ala Glu Leu Val His Ser He Lys Asn Met Phe Asp 
Glu Leu His (SEQ ID NO: 16), or homolog thereof, and a twelfth coil; (m) a third 3 10 
helix consisting of Ser Asp lie, and a thirteenth coil; (n) a sixth alpha helix consisting 

25 of Glu Thr Tyr He Asn (SEQ ID NO:17), or homolog thereof, and a fourteenth coil; (o) 
a fifth beta sheet strand consisting of Leu Leu He Arg (SEQ ID NO: 18), or homolog 
thereof, and a fifteenth coil; (p) a sixth beta sheet strand consisting of Glu Phe lie Phe 
(SEQ ID NO: 19), or homolog thereof, and a sixteenth coil; (q) a fourth 3i 0 helix 
consisting of Trp Pro Asp, and a seventeenth coil; and/or (r) a seventh alpha helix 

30 consisting of Glu Asp Glu Leu He Lys Cys lie Lys He Tyr Gin Ser (SEQ ID NO:20), or 
homolog thereof, and an eighteenth coil. 

In still another embodiment of the method, the second coil is connected to the 
first alpha helix, the third coil is connected to the second alpha helix, the fourth coil is 
connected to the second beta sheet strand, the fifth coil is connected to the first 3 10 
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helix, the sixth coil is connected to the third alpha helix, the seventh coil is connected 
to the third beta sheet strand, the eighth coil is connected to the second 3 10 helix, the 
ninth coil is connected to the fourth alpha helix, the tenth coil is connected to the 
fourth beta sheet strand, the eleventh coil is connected to the fifth alpha helix, the 
5 twelfth coil is connected to the third 3 10 helix, the thirteenth coil is connected to the 
sixth alpha helix, the fourteenth coil is connected to the fifth beta sheet strand, the 
fifteenth coil is connected to the sixth beta sheet strand, the sixteenth coil is 
connected to the fourth 3 10 helix, and/or the seventeenth coil is connected to the 
seventh alpha helix. In this description, the numerical adjectives, first, second and so 

10 forth, do not necessarily indicate a temporal or spatial order, but rather, serve merely 
to distinguish otherwise similarly named elements from one another. 

Knowledge of the three-dimensional structure allows solution, by the method 
of molecular replacement, of crystal structures of undecaprenyl pyrophosphate 
synthase bound to inhibitors, and use of the method of difference Fourier analysis to 

15 determine the bound conformation of the inhibitors. Knowledge of the bound 
conformation then allows for the design of inhibitors with better properties. 

Knowledge of the three-dimensional structure allows the user to solve, by the 
method of molecular replacement, the structure of undecaprenyl pyrophosphate 
synthase from any other organism. 

20 Knowledge of the three-dimensional structure allows the user to solve, by the 

method of molecular replacement, the structures of undecaprenyl pyrophosphate 
synthase mutants which may be used as probes of undecaprenyl pyrophosphate 
synthase activity. 

25 EXAMPLES 

The following non-limiting examples are presented to further illustrate the 
invention. 

Example 1 
Design of an Inhibitor 

30 The atomic coordinates of the polypeptide chains of S. aureus undecaprenyl 

pyrophosphate synthase, as identified in Figure 5, can be used in a computer to 
construct a three-dimensional model of the active site. A putative competitive 
inhibitor can be fit into a binding site on the enzyme. One such putative inhibitor is 
(2Z,6E,10E)-4-methyl-geranylgeranyl diphosphate. Ohnuma et al., 1989 Kinetic 
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studies on the prenyl chain elongation by undecaprenyl pyrophosphate synthase with 
artificial substrate homologues. 271 FEBS Lett 257 71. Modifications in the putative 
inhibitor can be made to prepare a virtual library of structurally related compounds. A 
docking program can then be used to evaluate interaction of each compound with the 
5 synthase, and to compare and rank the relative binding of the compounds to the 
synthase. 

Compounds that appear to have relatively high affinity for the synthase can be 
obtained or synthesized and evaluated in a biochemical or biological assay. A 
suitable biological assay can be a measurement of growth by, for example, changes 
10 in turbidity of a bacterial suspension culture. 

Example 2 

Use of an inhibitor of undecaprenyl pyrophosphate synthase activity 

to identify novel ligands 
Novel ligands capable of binding to an undecaprenyl pyrophosphate synthase 
15 substrate binding site can be identified by using a known inhibitor, for example, (S)- 
farnesyl thiopyrophosphate, or a substrate, for example, farnesyl pyrophosphate. 
Useful substrates of the synthase in addition to isoprenyl pyrophosphate (C 5 PP) and 
farnesyl pyrophosphate (C 15 PP) include C 20 PP, C 25 PP, C 30 PP, C35PP, C 40 PP, C 45 PP, 
and C 50 PP, where the subscript denotes the number of carbon atoms in the 
20 isoprenoid chain. Properties of (S)-farnesyl thiopyrophosphate are described by 
Chen et al. Chen YH et al. Probing the conformational change of Escherichia coli 
undecaprenyl pyrophosphate synthase during catalysis using an inhibitor and 
tryptophan mutants. 277 J.Biol.Chem. 7369 (2002). 

The atomic features of the known inhibitors or substrates are introduced into a 
25 suitable computer program that has information defining the substrate binding site. 
Typically, the information includes atomic coordinates of those amino acids that can 
bind to a known synthase substrate, such as are identified in Figure 5. The computer 
program can then display the three-dimensional structure of the binding site. Then a 
three-dimensional model of a test compound can be created in the computer 
30 program. 

A docking program can be used to dock the model of the test compound to 
the structure of the binding site. That is, the program fits the test molecule into the 
binding site, allowing for rotation of the bonds of the molecule to test the several 
conformation of the test molecule, and evaluates the quality of fit. Similarly, a three 
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dimensional model of the substrate or of an inhibitor of the synthase can be created 
and docking information obtained. Then the docking parameters of the test 
compound can be compared to those of the substrate or of the known inhibitor. The 
docking program can then provide an output which can rank order the association 
5 parameters of each test or comparison molecule to the synthase. 

In consequence, candidate compounds most likely to have high affinity for the 
binding site can be readily identified. Synthesis of the most potent test molecules, or 
otherwise obtaining them, can provide physical molecules for biochemical or 
biological analysis. 

10 The method can optionally include introducing the atomic coordinates of those 

water molecules bound to the substrate, such that the coordinates are available to 
the computer program. Optionally, one skilled in the art can introduce into the 
computer program the atomic coordinates of at least one synthase structural element. 
Exemplary structural elements are an alpha helix, a 3 10 helix, a strand of beta sheet, 

1 5 and a coil. The 3 10 helix can have the sequence Asn Trp Ser, a part of a polypeptide 
loop that can engage the substrate. 

The method can optionally also include incorporating the test compound in a 
biochemical synthase activity assay for a synthase; and then determining whether the 
test compound inhibits synthase activity in the assay. Suitable inhibitors can be 

20 further assessed in cell permeability studies, viability studies, and bacteremia studies, 
for example by biological assays. 

Example 3 

Undecaprenyl pyrophosphate synthase assay 
Undecaprenyl pyrophosphate synthase activity can be determined by 
25 standard methods. Measurement of synthase activity in vitro in the absence and 
presence of putative inhibitors can yield information on direct effects on the synthase. 
By comparison, measurements using viable bacteria in the absence and presence of 
putative inhibitors can yield information, when compared with in vitro analyses, of cell 
permeation. One skilled in the art will recognize that undecaprenyl pyrophosphate 
30 synthase substrates and close analogs thereof will be substantially cell impermeant 
under normal conditions. 

One suitable method of in vitro analysis follows. [ 14 C]-IPP (55 mCi/mmol) is 
incubated for up to 20 min at 25°C in the presence of IPP (2-400 //M), FPP (0.2-10 
jwM), synthase (0.01-0.1 /yM) in a suitable buffered solution. A suitable buffered 
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solution is 0.1% Triton X-100 in 50 mM KCI, 0.5 mM MgCI 2 ,100mM KOH-HEPES, pH 
7.5. To measure a rate, aliquots of the reaction mixture are removed at timed 
intervals and mixed with a solution of 10 mM EDTA to stop the reaction. The reaction 
products are extracted with 1-butanol, the phases separated, and the radioactive 
5 materials measured by scintillation counting. The butanol phase, which contains the 
undecaprenyl pyrophosphate, can be evaporated, and the pyrophosphate groups 
hydrolyzed in a solution of 20% propanol containing 4.4 units/ml acid phosphatase, 
0.1% Triton X-100, and 50 mM sodium acetate, pH 4.7. The resultant polyprenols 
are extracted with 1-hexane and spotted on a reversed-phase TLC plate and 
10 developed using acetone/water (19:1) as the mobile phase. The TLC plates are then 
analyzed by autoradiography. 

One skilled in the art can measure synthase activity in vitro using the assay 
described above in the absence and presence of putative inhibitors. 

15 Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more 
than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by 
the following claims. 

20 Unless otherwise indicated, all numbers expressing quantities of ingredients, 

reaction conditions, and so forth used in the specification and claims are to be 
understood as being modified in all instances by the term "about." Accordingly, 
unless indicated to the contrary, the numerical parameters set forth in this 
specification and attached claims are approximations that may vary depending upon 

25 the desired properties sought to be obtained by the present invention. 

All publications and patents mentioned herein, including those items listed 
below, are hereby incorporated by reference in their entirety as if each individual 
publication or patent was specifically and individually indicated to be incorporated by 
reference. In case of conflict, the present application, including any definitions herein, 

30 will control. Also incorporated by reference in their entirety are any polynucleotide 
and polypeptide sequences which reference an accession number correlating to an 
entry in a public database, such as those maintained by The Institute for Genomic 
Research (TIGR) and/or the National Center for Biotechnology Information (NCBI). 



